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During the last decades, classical models in language theory have been ex- 
tended by control mechanisms defined by monoids. We study which monoids 
cause the extensions of context-free grammars, finite automata, or finite state 
transducers to exceed the capacity of the original model. Furthermore, we 
investigate when, in the extended automata model, the nondeterministic vari- 
ant differs from the deterministic one in capacity. We show that all these 
conditions are in fact equivalent and present an algebraic characterization. 
In particular, the open question of whether every language generated by a va- 
lence grammar over a finite monoid is context-free is provided with a positive 
answer. 

1 Introduction 

The idea to equip classical models of theoretical computer science with a monoid (or a 
group) as a control mechanism has been pursued by several authors in the last decades 
[FSn2l lIMVMnil IKamOfll IMSOll IPauHOl lEKOQ] . This interest is justified by the fact 
that these extensions allow for a uniform treatment of a wide range of automata and 
grammar models: Suppose a storage mechanism can be regarded as a set of states on 
which a set of partial transformations operates and a computation is considered valid 
if the composition of the executed transformations is the identity. Then, this storage 
constitutes a certain monoid control. 

For example, in a pushdown storage, the operations push and pop (for each partic- 
ipating stack symbol) and compositions thereof are partial transformations on the set 
of words over some alphabet. In this case, a computation is considered valid if, in the 
end, the stack is brought back to the initial state, i.e., the identity transformation has 
been applied. As further examples, blind and partially blind multicounter automata (see 
[Gre78j ) can be regarded as finite automata controlled by a power of the integers and of 
the bicyclic monoid (see |RK09j ) . respectively. 
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Another reason for studying monoid controlled automata, especially in the case of 
groups, is that the word problems of a group G are contained in a full trio (such as the 
context-free or the indexed languages) if and only if the languages accepted by valence 
automata over G are contained in this full trio (see, for example, [KamOQl Proposition 
2]). Thus, valence automata offer an automata theoretic interpretation of word problems 
for groups. 

A similar situation holds for context-free grammars where each production is assigned 
a monoid element such that a derivation is valid as soon as the product of the monoid 
elements (in the order of the application of the rules) is the identity. Here, the integers, 
the multiplicative group of Q, and powers of the bicyclic monoid lead to additive and 
multiplicative valence grammars and Petri net controlled grammars, respectively. The 
latter are in turn equivalent to matrix grammars (with erasing and without appear- 
ance checking, see |DT09] for details). Therefore, the investigation of monoid control 
mechanisms promises very general insights into a variety of models. 

One of the most basic problems for these models is the characterization of those 
monoids whose use as control mechanism actually increases the power of the respective 
model. For monoid controlled automata, such a characterization has been achieved by 
Mitrana and Stiebe [MSOlj for the case of groups, but has not been established for 
monoids. For valence grammars, that is, context-free grammars with monoid control, 
very little was known in this respect up to date. It was an open problem whether 
valence grammars over finite monoids are capable of generating languages that are not 
context-free (see [FSCM p. 387]). 

Another important question is for which monoids the extended automata can be de- 
terminized, that is, for which monoids the deterministic variant is as powerful as the 
nondeterministic one. Mitrana and Stiebe [MSOlj have shown that automata controlled 
by a group cannot be determinized if the group contains at least one element of infinite 
order. However, the exact class of monoids for which automata can be determinized was 
not known to date. 

The contribution of this work is twofold. On the one hand, the open question of 
whether all languages generated by valence grammars over finite monoids are context- 
free is settled affirmatively. On the other hand, we present an algebraic dichotomy 
of monoids that turns out to provide a characterization for all the conditions above. 
Specifically, we show that the following assertions are equivalent: 

• Valence grammars over M generate only context-free languages. 

• Valence automata over M accept only regular languages. 

• Valence automata over M can be determinized. 

• Valence transducers over M perform only rational transductions. 

• In each finitely generated submonoid of M, only finitely many elements possess a 
right inverse. 
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2 Basic notions 

A monoid is a set M together with an associative operation and a neutral element. 
Unless defined otherwise, we will denote the neutral element of a monoid by 1 and its 
operation by juxtaposition. That is, for a monoid M and a,b E M, ab e M is their 
product. The opposite monoid Af°P of M has the same set of elements as M, but has 
the operation o with a o b := ba, a,b M. For a,b G M, we write a C 6 iff there are 
c,d & M such that b = ac = da. Let a G M. An element 6 G M with a5 = 1 is called 
a right inverse of a. If 6 G M obeys 6a = 1, it is a left inverse of a. An element that 
is both a left and a right inverse is said to be a two-sided inverse. By 1, we denote 
the trivial monoid that consists of just one element. M is said to be left-cancellative if 
ab = ac implies 6 = c for a, 6, c G M. Whenever M°p is left-cancellative, we say that M 
is right- cancellative. 

A subset iV C M is said to be a submonoid o/ M iff 1 G and a,b E N implies 
ab G A^. For a subset A^ C M, let (A) be the intersection of all submonoids N' of M 
that contain A^. That is, (A^) is the smallest submonoid of M that contains A^. (A) is 
also called the submonoid generated by N. We call a monoid finitely generated if it is 
generated by a finite subset. In each monoid M, we have the following submonoids: 

9^(M) ■.= {aeM\3beM:ab = l}, 
£(M) := {a G M I 36 G M : 6a = 1}. 

The elements of 1H(M) and £(M) are called right invertible and left invertible, respec- 
tively. In addition, for every element a G M, we define the sets 

t{a) := {be M \ ab = l}, 
^(a) := {6 G M I 6a = 1}. 

When using a monoid M as part of a control mechanism, the subset 

(E{M) := {a e M \ 3b,c e M : bac = 1} 

will play an important role. If in M every element has a two-sided inverse, we call M a 
group. 

Let S be a fixed countable set of abstract symbols, the finite subsets of which are 
called alphabets. For an alphabet X, we will write X* for the set of words over X. 
The empty word is denoted by A G X*. In particular, 0* = {A}. Together with the 
concatenation as its operation, X* is a monoid. We will regard every x G A as an 
element of X* , namely the word consisting only of one occurence of x. For a symbol 
X E X and a word w G X*, let \w\x be the number of occurrences of x in w. For a 
subset Y C X, let \w\y ■= YlxeY \''^\x- By we will refer to the length of w. By 
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X- C X* , for n € N, we denote the set of all words over X of length < n. Given 
alphabets X,Y, subsets of X* and X* x Y* are called languages and transductions, 
respectively. We define the shuffle Li LUL2 of two languages ^1,^2 ^ X* to be the set 
of all words w G X* such that w = uivi ■ ■ ■ UnVn for some Ui,Vi G X* , 1 < i < n, with 
ui ■ ■ ■ Un G Li, vi ■ ■ ■ Vn G When {w} is used as an operand for lu, we also just write 
w instead of {w}. For xi, . . . ,Xn G X, let (xi • • • x^)"^^^ := Xn ■ ■ ■ xi. 

Let M be a monoid. An automaton over M is a tuple A = (M, Q, E, qo,F), in which 
Q is a finite set of states, E is a finite subset Q x M x Q, called the set of edges, 
qo Q is the initial state, and F Q Q is the set of final states. The step relation of 
^ is a binary relation on Q x M, for which (p, a) (q, b) iff there is an edge {p, c, q) 
such that b = ac. The set generated by A is then 

S{A) :={aeM\3qeF: {qo, 1) {q, a)}. 

A valence automaton over M is an automaton A over X* x M, where X is an alphabet. 
A is said to be deterministic if all its edges are in Q x {X x M) x Q and, for each pair 
[q,x) G Q X X, there is at most one edge {q, {x,m),p) for m E M,p G Q. The language 
accepted by A is defined as 

L{A) := {weX* \ {w,l) € S{A)}. 

A finite automaton is a valence automaton over 1. For a finite automaton A = (X* x 
1,Q, E,qQ, F), we also write A = {X,Q, E,qQ, F). Languages accepted by finite au- 
tomata are called regular languages. A valence transducer over M is an automaton A 
over X* X y* X M, where X and Y are alphabets. The transduction performed by A is 

T{A) := {{x, y)eX*xY*\ (x, y, 1) € SiA)}. 

A finite state transducer is a valence transducer over 1. For a finite state transducer 
A = {X* X Y* X l,Q,E,qQ,F), we also write A = {X,Y,Q, E, qo, F). Transductions 
performed by finite state transducers are called rational transductions. 

A valence grammar over M is a tuple G = {N,T, M, P, S), where N,T are disjoint 
alphabets, called the nonterminal and terminal alphabet, respectively, P Q N x (N U 
T)* X M is a finite set of productions, and S € N is the start symbol. For a production 
{A,w,m) € P we also write {A w;m). The derivation relation =^g of G is a binary 
relation on (A^ U T)* x M, for which (u, a) =^g (v, b) iff there is a, {A ^ w;c) G P and 
words r, s € (A U T)* such that u = rAs, v = rws, and b = ac. The language generated 
by G is defined as 

L{G) ■.= {weT*\{S,l) Kl)}- 

Valence grammars were introduced by Paun in |Pau80] . A thorough treatment, including 
normal form results and a classification of the resulting language classes for commutative 
monoids, has been carried out by Fernau and Stiebe [FS02| . Valence grammars over 1 
are called context-free grammars. For a context-free grammar G = (A, T,l, P, S), we 
also write G = (A, T, P, S). Furthermore, a production (A ^ w;l) G P in a context-free 
grammar is also written A ^ w. Languages generated by context-free grammars are 
called context-free. 
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3 A dichotomy of monoids 



An infinite ascending chain in M is an infinite sequence xi,X2, - ■ ■ of pairwise distinct 
elements of M such that Xi Q Xj+i for all i G N. 

Lemma 1. Let M be left- or right-cancellative. Then exactly one of the following holds: 

1. M is a finite group. 

2. M contains an infinite ascending chain. 

Proof. Suppose M does not contain an infinite ascending chain. 

First, we prove that M is a group. Let r G M and consider the elements G M, 
z G N. Since r* C r^ for i < j, our assumption implies that there are z, j G N, i < j, with 
r* = r^ . Since M is left- or right-cancellative, this implies r-^~* = 1, meaning that r has 
in r^~^~^ a two-sided inverse. Thus, M is a group. 

This implies that r Q s for any r, s G M. Therefore, if M were infinite, it would 
contain an infinite ascending chain. □ 

Lemma 2. Let s,t e M, s j^t, and sQt. Then, ^(s)n^(t) = and ^{s)n^{t) = 0. 

Proof. We only show J (s) n 3 (t) = since then 21 (s) n 3 (t) = follows by applying 
the former to the opposite monoid. Write t = us, u G M, and suppose there were a 
z G J (s) n 3 {t). Then, 1 = tz = usz = u and thus t = s. □ 

Theorem 3. For every monoid M , exactly one of the following holds: 

1. The subsets £H(M), £(M), and <£{M) coincide and constitute a finite group. 

2. y{{M) and £(M) each contain an infinite ascending chain. In particular, there 
exist infinite sets S C «H(M) and S' C £(M) such that "^(s) n ^(t) = for 
s,teS,s^t, and ^{s') n = for s', t' G S' , s' ^ t' . 

Proof. First, we claim that 9\{M) is infinite if and only if £(M) is infinite. Here, it 
suffices that D\{M) being infinite implies the infinity of £{M), since the other direction 
follows by considering the opposite monoid. If y{{M) is infinite, it contains an infinite 
ascending chain according to Lemma [H By Lemma [21 the elements of the chain have 
pairwise disjoint sets of right inverses, which are non-empty. Since right inverses are left 
invertible, £(M) is infinite. 

Suppose D\{M) and £(M) are both finite. Since 1H(M) is right-cancellative, it is a 
group by Lemma [H Thus, we have D\{M) C £(M) and analogously £(M) C «H(M). 
In order to prove <S-{M) = d\{M), we observe that 9^(M) C ^(M) by definition. Now, 
suppose a G £(M) to be witnessed by bac = 1, 6, c G M. By this equation, we have 
b G 5H(M) and can multiply b~^ on the left and then b on the right. We obtain acb = 1 
and thus a G 1H(M). This proves that 9\{M) = £(M) = (£(M) and that this is a finite 
group. 

In case 9^(M) and 2{M) are both infinite, the infinite ascending chains are provided 
by Lemma [TJ By Lemma [51 their elements form sets S C d\{M) and S' C £(M) with 
the desired properties. □ 



5 



4 Capabilities of valence automata and transducers 



In this section, we show that the fohowing conditions are equivalent: 

• Valence automata over M accept only regular languages. 

• Valence automata over M can be determinized. 

• Valence transducers over M perform only rational transductions. 

• ^{N) is finite for every finitely generated submonoid of M. 

Lemma 4. Let $H(A^) he finite for every finitely generated submonoid N of M . Then, 
valence automata over M accept only regular languages and valence transducers over 
M perform only rational transductions. In particular, valence automata over M can be 
determinized. 

Proof. Let A = {X* x M, Q, E, qq, F) be a valence automaton over M. Since E is finite, 
the set of m € M such that there is some edge {p, {w,m),q) in E is finite. If N is the 
submonoid of M generated by these m € M, we can regard ^ as a valence automaton 
over N. Thus, let A = {X* x N,Q,E,qQ,F). Furthermore, removing edges of the 
form (p, {w,m),q) such that m ^ ^(A^) will not alter the accepted language, since such 
edges cannot be used in a successful run. By Theorem [3l ^ (A^) is a finite group and 
we can assume A = (X* x (B{N),Q,E,qQ,F). Since ^{N) is finite, a finite automaton 
accepting L{A) can now easily be constructed by incorporating the monoid elements into 
the states. The proof for the valence transducers over M works completely analogously. 

Since finite automata can be determinized and we have seen that valence automata 
over M accept only regular languages, it follows that valence automata over M can be 
determinized. □ 

In [MSOlj . Mitrana and Stiebe proved that valence automata over groups with at least 
one element of infinite order cannot be determinized. We can now use a similar idea 
and our dichotomy theorem to provide a characterization of those monoids over which 
valence automata can be determinized. 

Lemma 5. Let M be a finitely generated monoid such that is infinite. Then, 

there is a valence automaton over M whose accepted language cannot be accepted by a 
deterministic valence automaton over M . In particular, valence automata over M can 
accept non-regular languages and valence transducers over M can perform non-rational 
transductions. 

Proof. Let M be generated by the finite set {ai,...,a„} and let X = 
Y = {yi, . . . ,y„} be disjoint alphabets. Let ip : {X U Y)* — )• M be the epimorphism 
defined by ip{xi) := ^p{yi) := Oi and K := X* {w X*Y* \ ip{w) = 1}. Then, 
K is clearly accepted by a (nondeterministic) valence automaton over M. Suppose K 
were accepted by a deterministic valence automaton A over M. Let 5 C 1H(M) be 
the infinite set provided by Theorem [3l The infinity of S implies that we can find an 
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infinite set S' C X* such that (p{S') = S and if{u) ^ if{v) for u,v € S' , u v. Since 
A is deterministic and S' C L{A), each word w £ S' causes A to enter a configuration 
{q{w), 1), where is a final state. Choose u,v £ S' such that u ^ v and (7(n) = q{v). 
Let n' G y* be a word such that Lp{u)(p{u') = 1, this is possible since ^{u) G 9^(M) and 
y5 is surjective. The word u' causes A to go from {q{u), 1) = ), 1) to (g, 1) for some 
final state q, since nn' G K. Thus, vu' is also contained in K and hence ip{v)ip{u') = 1, 
but ~^ {ip{u)) n J (v'('y)) = 0, a contradiction. 

Thus, K is not accepted by a deterministic valence automaton over M. In particular, 
is not regular. Furthermore, from the valence automaton accepting a valence trans- 
ducer can be constructed that maps {A} to K. Since K is not regular, the transduction 
performed by the transducer is not rational. □ 



5 Capabilities of valence grammars 

In this section, it is shown that the following conditions are equivalent: 

1. Valence grammars over M generate only context-free languages. 

2. ^{N) is finite for every finitely generated submonoid N of M. 

In one of the directions, we have to construct a context-free grammar for valence gram- 
mars over monoids that fulfill the second condition. Because of the limited means avail- 
able in the context-free case, the constructed grammar can simulate only a certain frag- 
ment of the derivations in the valence grammar. Thus, we will have to make sure that 
every word generated by the valence grammar has a derivation in the aforementioned 
fragment. These derivations are obtained by considering the derivation tree of a given 
derivation and then choosing a suitable linear extension of the tree order. The construc- 
tion of these linear extensions can already be described for a simpler kind of partial 
order, valence trees. 

Let X be an alphabet and U Q X a subset. Then, each word w G X* has a unique 
decomposition w = yoxiyi ■ ■ ■ XnVn such that yo^yn G (X \ [/)*, y.j G (X \ U)'^ for 
1 < i < n — 1 and Xi G for \ < i < n. This decomposition is called U -decomposition 
of w and we define p{w, U) := n. 

A tree is a finite partially ordered set (T, <) that has a least element and where, for 
each t G T, the set {i' G T | t' < t} is totally ordered by <. The least element is also 
called the root and the maximal elements are called leaves. A valence tree T over M is 
a tuple (7~, where (T, <) is a tree and 99 : 7~* — )• M is a homomorphisnfl assigning 

a valence to each node. An evaluation defines an order in which the nodes in a valence 
tree can be traversed that is compatible with the tree order. Thus, an evaluation of T 
is a linear extension ■< of (T, <). Let w £ T* correspond to ^, i.e., let T = {ti, . . . , in} 
such that ti ^ ■ ■ ■ ^ tn and w = ti • • • t„. Then the value of ^ is defined to be '■p{w). 
An element f G Af is called a value of T if there exists an evaluation of (T, <) with 
value V. Given a node t G T, let Ut := {f £ T \ t < t'}. li w = yoxiyi ■ ■ ■ Xnyn is the 

^We will often assume, without loss of generality, that T is an alphabet. 
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C/j-decomposition of w, then ^p{xi), . . . ,ip{xn) is called the valence sequence of t in w 
and n its length. By the excursiveness of an evaluation, we refer to the maximal length 
of a valence sequence. Hence, the excursiveness of an evaluation is the maximal number 
of times one has to enter any given subtree when traversing the nodes in the order given 
by the evaluation. We are interested in finding evaluations of valence trees with small 
excursiveness. Of course, for every valence tree, there are evaluations with excursiveness 
one (take, for example, the order induced by a preorder traversal), but these might not 
be able to cover all possible values. However, we will see in Lemma [5] that, in the case 
of a finite group, there exists a bound m such that every value can be attained by an 
evaluation of excursiveness at most m. 

Lemma 6. For each finite group G, there is a constant m G N with the following 
property: For elements gi,hi G G, i = l,...,n, n > m, there are indices k,i G N, 
1 < k < i <n, such that guhk ■ ■ ■ gihg = gk ■ ■ ■ gihk ■■■he. 

Proof. Let m = 2{\G\^ + 1) and D C be the set of odd indices. Define 

the map a : D ^ G^ hy a{i) := {gi---gi, gihi ■ ■ ■ gihi) for i ^ D. Since 

\D\ > \G\^ + 1, there are indices i, j & D, i < j, such that a{i) = a{j). This means that 
di+i ■ ■ ■ Qj = ^1 hi+i ■ ■ ■ hj = 1, and g'j+i/ij+i • • • gjhj = 1. Since i,j are both odd, letting 
k = i + 1 and i = j implies k < i and yields the desired equality. □ 

Lemma 7. Let X he an alphabet and U,V C X subsets such that either U Q V , V U , 
or [/ n F = 0. Furthermore, let r G X*U, x G [/+, y e {X \ and s G X* \ UX*. 

Then, we have p{rxys, V) < p{ryxs, V). 

Proof. Suppose V QU. Since y does not contain any symbols in V, we have 

p{ryxs, V) = p{r, V) + p{x, V) + p{s, V), 
p{rxys, V) = p{rx, V) + p{s, V). 

Thus, 

p{rxys, V) = p{rx, V) + p{s, V) 

<p{r,V)+p{x,V) + p{s,V) 
= p{ryxs, V). 

In the case U CiV = ^, x does not contain any symbol in V. Hence, 

p{ryxs, V) = p{r, V) + p{y, V) + p{s, V), 
p{rxys, V) = p{r, V) + p{ys, V), 

which implies 

p{rxys, V) = p{r, V) + p{ys, V) 

< p{r,V) + p{y,V) + p{s,V) 
= p{ryxs, V). 

Now suppose U (^V. Since the rightmost letter of r is in y and x lies in V'^, we have 
p{rxys,V) = p{rys,V). Thus, p{rxys,V) = p{rys,V) < p{ryxs,V). □ 
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Lemma 8. For each finite group G, there is a constant m such that each value of a 
valence tree over G has an evaluation of excursiveness at most m. 

Proof. For an alphabet X, we denote the set of multisets over X, i.e., maps X ^ N, by 
X®. X® carries a (commutative) monoid structure by way of (a + := a{x) + /3(x) 

for X (z X. To every evaluation w of (T, <), we assign the multiset /i^ G T®, which is 
defined by := p{w, Ut) for every t £ T. That is, is the length of the valence 

sequence of t in tf. 

Let m be the constant provided by Lemma [6] and let w G T* be an evaluation of 
(7", <) such that is minimal with respect to 1^ among all evaluations with value v. 
If we can prove that < m for all t T, the lemma follows. Therefore, suppose 

that there is a t G T with n := ^wif) > mn. Specifically, let w = yoxiyi ■ ■ ■ XnVn be the 
?7t-decomposition of w. Use Lemma [6] to find indices 1 < k < i < n with 

ip{xk)ip{yk) ■ ■ ■ ip{xe)ip{ye) = ip{xk) • • • ip{xi)ip{yk) • • • ^{ye)- (1) 
Furthermore, let 

w' := {yoxiyi ■ ■ ■ Xk-iyk^i){xk ■ ■ ■ xiyk ■ ■ ■ ye){xe+iye+i ■ ■ ■ Xnyn)- (2) 

That is, we obtain w' from vu by replacing Xkyk ■ • • ^lyi with Xk - ■ ■ xiyk ■ ■ - ye- Then, ([T]) 
means that (p{w') = (p{w). We shall prove that w' is an evaluation of (T, <) and obeys 
Hw' C fiw, which contradicts the choice of w. 

First, we prove that vj' is an evaluation. Let ui,U2 G T be nodes with ui < U2. If 
ui < t, then ui appears in yo, and thus U2 is on the right side of ui in vu' . If ui > t, then 
each of the nodes ui,U2 appears in some Xi and therefore do not change their relative 
positions. If ui and t are incomparable, then U2 and t are also incomparable and each 
of ui,U2 appears in some y^. Again, ui and U2 do not change their relative positions. 
Thus, w' corresponds to a linear extension of <. 

We want to show that Q fiw To this end, we consider the words 

Wi := (yoxiyi ■ ■ ■ Xk-iyk-i){xk ■ ■ ■ Xk+iyk ■ ■ ■ yk+i){xk+i+iyk+i+i ■ ■ ■ Xny-n) 



for {) < i < i — k. With these, we have w = 
we have Uu ^ Ut, Ut Q Uu, or f/^ n C/j = 
Lemma[7]to U := Ut, V := Uu, and 



wq and w' = wc^k- Since (T, <) is a tree, 
for every u £ T. Therefore, we can apply 



r ■■= {yoxiyi ■ ■ ■ Xk-iyk-i){xk ■ ■ ■ Xk+i), x := Xk+i+i^ 

y ■=yk--- yk+i, s -.= yk+i+i{xk+i+2yk+i+2 ■ ■ ■ Xny-n), 

which yields p{wi+i, Uu) < p{wi, Uu) for < i < £ — k. This implies Pw'{u) < pw{u) and 
therefore puj' E Pw 

It remains to be shown that pui' is strictly smaller than pui- In w' , the node t has the 
valence sequence 

^p{xi), . . . ,Lp{xk-l),ip{Xk ■ ■■Xl)),Lp{xi,+ l), ■ ■ ■ ip{Xn), 

which has length pw'{i) = n — {£ — k) < n = fiw{t) ■ D 
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We define a derivation tree for a valence grammar G = {N, T, M, P, S) to be a tuple 
(T, <, if, {<t)t€r, A), where 

• (T, <,f) is a valence tree, 

• for each i € T, <t is a total order on the set of successors of t, 

• A:T— T-A^UTU {A} defines a label for each node, 

• if t G T is a node with the successors si, . . . ,Sn such that si <t • • • <t s„, then we 
either have A(t) € T U {A}, n = 0, and (p{t) = 1 or we have A(t) G A^ and there is 
a production (A(t) A(si) • • • A(s„); in P. 

The total orders <t, t G T, induce a total order on the set of leaves (see [HU79|. Section 
4.3] for details), which in turn defines a word w € T*. This word is called the yield of 
the derivation tree. 

Each derivation tree can be regarded as a valence tree. An evaluation then defines a 
derivation (^,1) =^q (wjv), where ^ G A^ is the label of the root, w is the yield, and 
w G M is the value of the evaluation. Conversely, every derivation induces a derivation 
tree and an evaluation. Thus, a word li; G T* is in L(G) iff there exists a derivation tree 
for G with yield w, a root labeled 5, and an evaluation with value 1. See |FS02t Section 
4.2] for details. 

Lemma 9. Let D\{N) be finite for every finitely generated submonoid N of M . Further- 
more, let G = {N, T, M, P, S) be a valence grammar over M . Then, L{G) is context-free. 

Proof. As in the proof of Lemma U we can assume that M is finitely generated and 
thus has a finite 1H(M). Since productions {A — )• w;m) with m ^ <B{M) cannot be 
part of a successful derivation, their removal does not change the generated language. 
Furthermore, by Theorem [3l G;(M) is a finite group. Thus, we can assume that G = 
{N,T, H, P, S), where H = (B{M) is a finite group. By a simple construction, we can 
further assume that in G, every production is of the form (A —> w; h) with w G A^* or 
{A w; 1) with w £TU {A}. 

We shall construct a context-free grammar G' = {N' ,T, P' , S') for L{G). The basic 
idea is that G' will simulate derivations of bounded excursiveness. This is done by 
letting the nonterminals in G' consist of a nonterminal ^ G A^ and a finite sequence a 
of elements from H. G' then simulates the generation of a nonterminal A by generating 
a pair {A, a) and thereby guesses that the corresponding node in the derivation tree of 
G will have a as its valence sequence. Lemma [8] will then guarantee that this allows G' 
to derive all words in L{G) when the sequences a are of bounded length. 

Formally, we will regard H as an alphabet and a sequence will be a word over H. In 
order to be able to distinguish between the concatenation of words in H* and the group 
operation in H, we will denote the concatenation in H* by □. Thus, let N' = N x H-"^, 
in which m G N is the constant provided by Lemma [8] for the group H. The set of 
sequences that can be obtained from another sequence a by "joining" subsequences is 
denoted by J (a): 

J{hinh2na) := J{{hih2)na) U {hiDa' \ a' G J{h2Ucr)} 
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for hi,h2 G H and a € H* and J(o") := {a} if |o"| < 1. J is defined for subsets S C H* 

by J(5) := Ue5-^(^)- 

For each production {A ^ w; h) G P, w = Bi ■ ■ ■ Bn, Bi G N ior 1 < i < n, we include 
the production 

for each a G F-™ \ {A} and ai,. . . ,an € F-™- such that for a = hiUa' , hi € H, 
a' G H-^~^, one of the fohowing holds: 

• (/l^^/li)n(T' G J((Ti LU • • • LU (T„). 

• hi = h and a' G J(cti lu • • • lu cr„). 

Furthermore, for every production (^4 — )■ 1), it; G T U {A}, we inckide A) — >■ u;. 
Finally, the start symbol of G' is (S*, 1). 

It remains to be shown that L{G') = L{G). In order to prove L{G') C L{G), one can 
show by induction on n that for w £ T*, (A, a) w implies that there is a derivation 
{A, 1) =^*Q {w, h) for some h G H using productions {Ai — > wi; hi), . . . , (^fc — > tUfc; /i^) 
such that cj G J {hiO ■ ■ ■ Ohk) ■ This implies that for {S,l) =^*qi w, w G T*, we have 
w G Thus, L(G') C 

Let G L{G) with derivation tree (T, <, 92, (<t)tgri By Lemma [HI there is an 
evaluation ^ of the tree of excursiveness < m. From the tree and the evaluation, we 
construct a derivation tree {T, <, y?', {<t)teTj foi" w in G' as follows. The components 
T, <, and <t, t G T, stay unaltered, but cp' will assign 1 to each node and A' is defined 
by A'{t) := A{t) if A{t) G T U {A} and A'{t) := (A(t), /iiD ■ ■ ■ D/ifc) if A{t) G iV, where 
/ii, . . . , /ifc is the valence sequence of t in ^. Now, one can see that the new tree is a 
derivation tree for G' that generates w with any evaluation. Hence, L(G) Q L{G'). □ 

In order to prove the main result of this section, we need to exhibit a valence gram- 
mar over M that generates a non-context-free language when given a finitely generated 
monoid M with infinite y{{M). In the proof that the generated language is not context- 
free, we will use the following well-known Iteration Lemma by Ogden |Qgd68| . 

Lemma 10 (Ogden). For each context-free language L, there is an integer m such that 
for any word z G L and any choice of at least m distinct marked positions in z, there is 
a decomposition z = uvwxy such that: 

1. w contains at least one marked position. 

2. Either u and v both contain marked positions, or x and y both contain marked 
positions. 

3. vwx contains at most m marked positions. 

4. uv^wx'^y G L for every i > 0. 

Lemma 11. Let 9^(M) be infinite for some finitely generated monoid M . Then, there 
is a valence grammar over M that generates a language that is not context-free. 
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Proof. Let M be generated by ai,...,an and let X = {xi, . . . ,Xn} be an alphabet. 
Furthermore, let (p : X* — )■ M be the surjective homomorphism defined by ^{xi) = ai. 
The valence grammar G = {N,T, M, P, Sq) is defined as follows. Let = {^OiS"!}, 
T = X U {c}, and let P consist of the productions 

{So ^ XiSoXi,ai), {So^cSic,!), {Si ^ XiSi,ai), (5i^-A,l) 

for 1 < i < n. Then, clearly L{G) = K := {rcscr^'^^ | r, s G X*, ip{rs) = 1}. It 
remains to be shown that K is not context-free. Suppose K is context-free and let m 
be the constant provided by Lemma [TOl By Theorem [3l we can find an infinite subset 
S C £(M) such that J (a) fl J (6) = for a, 6 G 5, a 7^ 6. Since is surjective, we 
can define l{a) for every a € 5 to be the minimal length of a word w € X* such that 
(p{w)a = 1. If i{a) < m for all a £ S, the finite set {ip{w) \ w £ X* , \w\ < m} contains 
a left inverse for every a £ S. This, however, contradicts the fact that the infinitely 
many elements of S have disjoint sets of left inverses. Thus, there exists an a € 5 with 
i{a) > m. We choose words r, s G X* such that (p{s) = a and r is of minimal length 
among those words satisfying ip{rs) = 1. Then, by the choice of a, we have |r| > m. 

We apply the Iteration Lemma to the word z = rcscr^"^^ € K, where we choose the 
first |r| symbols to be marked. Let z = uvwxy be the decomposition from the lemma. 
Condition [1] implies \uv\ < \r\. Because offU x cannot contain a c. Furthermore, x 
cannot be a subword of r, since then pumping would lead to words with mismatching 
first and third segment. In particular, from condition [2l the first part holds and v is not 
empty. Thus, if x were a subword of s, pumping would again lead to a mismatching first 
and third segment. Hence, x is a subword of r"^^^. If we now pump with i = 0, we obtain 
a word r'cscr" G K, where \r'\ < \r\. In particular, we have ip{r' s) = 1, in contradiction 
to the choice of r. □ 

Theorem 12. Let M he a monoid. The following conditions are equivalent: 

1. Valence grammars over M generate only context-free languages. 

2. Valence automata over M accept only regular languages. 

3. Valence automata over M can he determinized. 

4. Valence transducers over M perform only rational transductions. 

5. yi{N) is finite for every finitely generated submonoid N of M. 

6. 2,{N) is finite for every finitely generated submonoid N of M . 

7. <£{N) is finite for every finitely generated submonoid N of M . 

Proof. Theorem [3] immediately implies that O [6] and [7] are equivalent. [T] is equivalent 
to [S] by Lemma [TT] and Lemma O Lemmas and H] prove that [21 El and 2] are each 
equivalent to El □ 
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